Overview

Dataset Statistics

Number of Variables 14
Number of Rows 10000
Missing Cells 0
Missing Cells (%) 0.0%
Duplicate Rows 0
Duplicate Rows (%) 0.0%
Total Size in Memory 2.6 MB
Average Row Size in Memory 276.4 B
Variable Types
  • Numerical: 7
  • Categorical: 6
  • GeoGraphy: 1

Dataset Insights

RowNumber is uniformly distributed Uniform
Balance is skewed Skewed
Surname has a high cardinality: 2932 distinct values High Cardinality
NumOfProducts has constant length 1 Constant Length
HasCrCard has constant length 1 Constant Length
IsActiveMember has constant length 1 Constant Length
Exited has constant length 1 Constant Length
Balance has 3617 (36.17%) zeros Zeros

Variables


RowNumber

numerical

Approximate Distinct Count 10000
Approximate Unique (%) 100.0%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 156.2 KB
Mean 5000.5
Minimum 1
Maximum 10000
Zeros 0
Zeros (%) 0.0%
Negatives 0
Negatives (%) 0.0%
  • RowNumber is uniformly distributed

Quantile Statistics

Minimum 1
5-th Percentile 500.95
Q1 2500.75
Median 5000.5
Q3 7500.25
95-th Percentile 9500.05
Maximum 10000
Range 9999
IQR 4999.5

Descriptive Statistics

Mean 5000.5
Standard Deviation 2886.8957
Variance 8.3342×1006
Sum 5.0005×1007
Skewness 0
Kurtosis -1.2
Coefficient of Variation 0.5773
  • RowNumber is not normally distributed (p-value 1.2892303425383614e-88)

CustomerId

numerical

Approximate Distinct Count 10000
Approximate Unique (%) 100.0%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 156.2 KB
Mean 1.5691×1007
Minimum 1.5566×1007
Maximum 1.5816×1007
Zeros 0
Zeros (%) 0.0%
Negatives 0
Negatives (%) 0.0%
  • CustomerId is skewed right (γ1 = 0.0011)

Quantile Statistics

Minimum 1.5566×1007
5-th Percentile 1.5579×1007
Q1 1.5629×1007
Median 1.5691×1007
Q3 1.5753×1007
95-th Percentile 1.5803×1007
Maximum 1.5816×1007
Range 249989
IQR 124705.5

Descriptive Statistics

Mean 1.5691×1007
Standard Deviation 71936.1861
Variance 5.1748×1009
Sum 1.5691×1011
Skewness 0.001149
Kurtosis -1.1961
Coefficient of Variation 0.004585

Surname

categorical

Approximate Distinct Count 2932
Approximate Unique (%) 29.3%
Missing 0
Missing (%) 0.0%
Memory Size 697.6 KB

Length

Mean 6.4349
Standard Deviation 2.2739
Median 6
Minimum 2
Maximum 23

Sample

1st row Hargrave
2nd row Hill
3rd row Onio
4th row Boni
5th row Mitchell

Letter

Count 63946
Lowercase Letter 53647
Space Separator 55
Uppercase Letter 10299
Dash Punctuation 19
Decimal Number 0
  • Surname contains many words: 2926 words

CreditScore

numerical

Approximate Distinct Count 460
Approximate Unique (%) 4.6%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 156.2 KB
Mean 650.5288
Minimum 350
Maximum 850
Zeros 0
Zeros (%) 0.0%
Negatives 0
Negatives (%) 0.0%
  • CreditScore is skewed left (γ1 = -0.0716)

Quantile Statistics

Minimum 350
5-th Percentile 489
Q1 584
Median 652
Q3 718
95-th Percentile 812
Maximum 850
Range 500
IQR 134

Descriptive Statistics

Mean 650.5288
Standard Deviation 96.6533
Variance 9341.8602
Sum 6.5053×1006
Skewness -0.0716
Kurtosis -0.4261
Coefficient of Variation 0.1486
  • CreditScore is not normally distributed (p-value 3.890217360522603e-06)
  • CreditScore has 15 outliers

Geography

categorical

Approximate Distinct Count 3
Approximate Unique (%) 0.0%
Missing 0
Missing (%) 0.0%
Memory Size 693.4 KB
  • The largest value (France) is over 2.0 times larger than the second largest value (Germany)

Length

Mean 6.0032
Standard Deviation 0.7061
Median 6
Minimum 5
Maximum 7

Sample

1st row France
2nd row Spain
3rd row France
4th row France
5th row Spain

Letter

Count 60032
Lowercase Letter 50032
Space Separator 0
Uppercase Letter 10000
Dash Punctuation 0
Decimal Number 0
  • The top 2 categories (France, Germany) take over 50.0%
  • The largest value (france) is over 2.0 times larger than the second largest value (germany)

Gender

categorical

Approximate Distinct Count 2
Approximate Unique (%) 0.0%
Missing 0
Missing (%) 0.0%
Memory Size 682.7 KB

Length

Mean 4.9086
Standard Deviation 0.9959
Median 4
Minimum 4
Maximum 6

Sample

1st row Female
2nd row Female
3rd row Female
4th row Female
5th row Female

Letter

Count 49086
Lowercase Letter 39086
Space Separator 0
Uppercase Letter 10000
Dash Punctuation 0
Decimal Number 0
  • The top 2 categories (Male, Female) take over 50.0%

Age

numerical

Approximate Distinct Count 70
Approximate Unique (%) 0.7%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 156.2 KB
Mean 38.9218
Minimum 18
Maximum 92
Zeros 0
Zeros (%) 0.0%
Negatives 0
Negatives (%) 0.0%
  • Age is skewed right (γ1 = 1.0112)

Quantile Statistics

Minimum 18
5-th Percentile 25
Q1 32
Median 37
Q3 44
95-th Percentile 60
Maximum 92
Range 74
IQR 12

Descriptive Statistics

Mean 38.9218
Standard Deviation 10.4878
Variance 109.9941
Sum 389218
Skewness 1.0112
Kurtosis 1.394
Coefficient of Variation 0.2695
  • Age is not normally distributed (p-value 8.381954731261811e-05)
  • Age has 359 outliers

Tenure

numerical

Approximate Distinct Count 11
Approximate Unique (%) 0.1%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 156.2 KB
Mean 5.0128
Minimum 0
Maximum 10
Zeros 413
Zeros (%) 4.1%
Negatives 0
Negatives (%) 0.0%
  • Tenure is skewed right (γ1 = 0.011)

Quantile Statistics

Minimum 0
5-th Percentile 0
Q1 2
Median 5
Q3 7
95-th Percentile 9
Maximum 10
Range 10
IQR 5

Descriptive Statistics

Mean 5.0128
Standard Deviation 2.8922
Variance 8.3647
Sum 50128
Skewness 0.01099
Kurtosis -1.1652
Coefficient of Variation 0.577
  • Tenure is not normally distributed (p-value 0.0003022274726441247)

Balance

numerical

Approximate Distinct Count 6382
Approximate Unique (%) 63.8%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 156.2 KB
Mean 76485.8893
Minimum 0
Maximum 250898.09
Zeros 3617
Zeros (%) 36.2%
Negatives 0
Negatives (%) 0.0%
  • Balance is skewed left (γ1 = -0.1411)

Quantile Statistics

Minimum 0
5-th Percentile 0
Q1 0
Median 97198.54
Q3 127644.24
95-th Percentile 162711.669
Maximum 250898.09
Range 250898.09
IQR 127644.24

Descriptive Statistics

Mean 76485.8893
Standard Deviation 62397.4052
Variance 3.8934×1009
Sum 7.6486×1008
Skewness -0.1411
Kurtosis -1.4893
Coefficient of Variation 0.8158
  • Balance is not normally distributed (p-value 5.1498210695739964e-23)

NumOfProducts

categorical

Approximate Distinct Count 4
Approximate Unique (%) 0.0%
Missing 0
Missing (%) 0.0%
Memory Size 644.5 KB

Length

Mean 1
Standard Deviation 0
Median 1
Minimum 1
Maximum 1

Sample

1st row 1
2nd row 1
3rd row 3
4th row 2
5th row 1

Letter

Count 0
Lowercase Letter 0
Space Separator 0
Uppercase Letter 0
Dash Punctuation 0
Decimal Number 10000
  • The top 2 categories (1, 2) take over 50.0%
  • NumOfProducts has words of constant length

HasCrCard

categorical

Approximate Distinct Count 2
Approximate Unique (%) 0.0%
Missing 0
Missing (%) 0.0%
Memory Size 644.5 KB
  • The largest value (1) is over 2.4 times larger than the second largest value (0)

Length

Mean 1
Standard Deviation 0
Median 1
Minimum 1
Maximum 1

Sample

1st row 1
2nd row 0
3rd row 1
4th row 0
5th row 1

Letter

Count 0
Lowercase Letter 0
Space Separator 0
Uppercase Letter 0
Dash Punctuation 0
Decimal Number 10000
  • The top 2 categories (1, 0) take over 50.0%
  • The largest value (1) is over 2.4 times larger than the second largest value (0)
  • HasCrCard has words of constant length

IsActiveMember

categorical

Approximate Distinct Count 2
Approximate Unique (%) 0.0%
Missing 0
Missing (%) 0.0%
Memory Size 644.5 KB

Length

Mean 1
Standard Deviation 0
Median 1
Minimum 1
Maximum 1

Sample

1st row 1
2nd row 1
3rd row 0
4th row 0
5th row 1

Letter

Count 0
Lowercase Letter 0
Space Separator 0
Uppercase Letter 0
Dash Punctuation 0
Decimal Number 10000
  • The top 2 categories (1, 0) take over 50.0%
  • IsActiveMember has words of constant length

EstimatedSalary

numerical

Approximate Distinct Count 9999
Approximate Unique (%) 100.0%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 156.2 KB
Mean 100090.2399
Minimum 11.58
Maximum 199992.48
Zeros 0
Zeros (%) 0.0%
Negatives 0
Negatives (%) 0.0%
  • EstimatedSalary is skewed right (γ1 = 0.0021)

Quantile Statistics

Minimum 11.58
5-th Percentile 9851.8185
Q1 51002.11
Median 100193.915
Q3 149388.2475
95-th Percentile 190155.3755
Maximum 199992.48
Range 199980.9
IQR 98386.1375

Descriptive Statistics

Mean 100090.2399
Standard Deviation 57510.4928
Variance 3.3075×1009
Sum 1.0009×1009
Skewness 0.002085
Kurtosis -1.1815
Coefficient of Variation 0.5746

Exited

categorical

Approximate Distinct Count 2
Approximate Unique (%) 0.0%
Missing 0
Missing (%) 0.0%
Memory Size 644.5 KB
  • The largest value (0) is over 3.91 times larger than the second largest value (1)

Length

Mean 1
Standard Deviation 0
Median 1
Minimum 1
Maximum 1

Sample

1st row 1
2nd row 0
3rd row 1
4th row 0
5th row 0

Letter

Count 0
Lowercase Letter 0
Space Separator 0
Uppercase Letter 0
Dash Punctuation 0
Decimal Number 10000
  • The top 2 categories (0, 1) take over 50.0%
  • The largest value (0) is over 3.91 times larger than the second largest value (1)
  • Exited has words of constant length

Interactions

Correlations

Missing Values